Comparison of Srp-phat and Multiband-popi Algorithms for Speaker Localization Using Particle Filters
نویسندگان
چکیده
The task of localizing single and multiple concurrent speakers in a reverberant environment with background noise poses several problems. One of the major problems is the severe corruption of the frame-wise localization estimates. To improve the overall localization accuracy, we propose a particle filter based tracking algorithm using the recently proposed Multiband Joint PositionPitch (M-PoPi) localization algorithm as a frame wise likelihood estimate. To prove the performance of our approach, we tested it on real-world recordings of seven different speakers and of up to three concurrent speakers. We compared our new approach to the well-known SRP-PHAT algorithm as frame-wise likelihood estimates. Finally, we compared both particle filter based tracking algorithms with their frame-wise localization algorithms. The MPoPi based particle filter tracking algorithm outperforms the SRPPHAT based particle filter tracking algorithm. The comparison with their frame wise localization algorithms shows that this improved performance stems from the more robust M-PoPi frame wise localization estimate.
منابع مشابه
Experimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localization
This paper proposes an enhancement and evaluates the performance of the joint position and pitch estimation (PoPi) algorithm for speaker localization. A modification in the algorithm is introduced in order to improve the performance under high reverberation levels. The performance of the proposed method is evaluated by measuring the correct estimate of position at a frame level. This evaluation...
متن کاملConcurrent speaker localization using multi-band position-pitch (m-popi) algorithm with spectro-temporal pre-processing
Accurate, microphone-based speaker localization in real-world environments, like office spaces or meeting rooms, must be able to track a single speaker and multiple concurrent speakers in the presence of reverberations and background noise. Our Multiband Joint Position-Pitch (M-PoPi) algorithm for circular microphone arrays already shows a frame-wise localization estimation score of about 95% f...
متن کاملFast and Robust Realtime Speaker Tracking Using Multichannel Audio and a Particle Filter
In this work a method to track the azimuth (horizontal angle) from multiple speakers in a typically reverberant real office environment is presented. The steered-response-power algorithm (SRP-PHAT) or the recently published joint position and pitch extraction approach (PoPi) combined with a sequential Monte Carlo estimation leads to a robust and fast tracker for audio indexing. One intention of...
متن کاملRobust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array
of “Robust cross-correlation-based methods for sound-source localization and separation using a large-aperture microphone array,” by Hoang T. Do, Ph.D., Brown University, May 2011 Microphone arrays have been used in many applications, such as: teleconferencing, speech recognition, talker characterization, speech enhancement, source localization and separation, etc. Despite the fast-paced develo...
متن کاملSpeaker Localization , tracking and remote speech pickup in a conference room
Effective speech communication using microphone Array is getting significant research in speech acquisition methods such as speaker localization and tracking. Localization techniques play an important role for automatic camera in videoconference system and for other human machine interfaces. To locate the accurate Direction Of Arrival (DOA) from the source, it is necessary to design a suitable ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010